# 1K image processing
Sapiens Seg 0.6b Bfloat16
Sapiens is a family of Vision Transformer models pre-trained on 300 million 1024x1024 resolution human images, focusing on human-centric vision tasks.
Image Segmentation English
S
facebook
24
0
Sapiens Pose 1b Bfloat16
Sapiens is a vision transformer series model pre-trained on 300 million 1024x1024 resolution human images, focusing on human-centric vision tasks.
Pose Estimation English
S
facebook
31
0
Sapiens Depth 2b
Sapiens is a family of vision Transformer models pre-trained on 300 million 1024×1024 resolution human images, focusing on human-centric vision tasks.
3D Vision English
S
facebook
40
3
Sapiens Seg 0.3b
Sapiens is a family of Vision Transformer models pre-trained on 300 million 1024×1024 resolution human images, focusing on human-centric vision tasks.
Image Segmentation English
S
facebook
48
2
Sapiens Pretrain 0.6b
Sapiens is a Vision Transformer model pre-trained on 300 million 1024×1024 resolution human images, excelling in human-centric vision tasks.
Image Classification English
S
facebook
13
0
Sapiens Pretrain 1b
Sapiens is a vision Transformer model pretrained on 300 million high-resolution human images, focusing on human-centric vision tasks.
Face-related English
S
facebook
48
1
Sapiens Seg 1b Torchscript
Sapiens is a series of vision transformers pre-trained on 300 million 1024×1024 resolution human images, specifically designed for human-centric vision tasks with exceptional generalization capabilities.
Image Segmentation English
S
facebook
892
1
Sapiens Pose 1b Torchscript
Sapiens is a vision Transformer model pre-trained on 300 million 1024x1024 resolution human images, specifically designed for high-precision pose estimation tasks.
Pose Estimation English
S
facebook
1,245
7
Featured Recommended AI Models